Disambiguating Explicit Discourse Connectives without Oracles
نویسندگان
چکیده
Deciding whether a word serves a discourse function in context is a prerequisite for discourse processing, and the performance of this subtask bounds performance on subsequent tasks. Pitler and Nenkova (2009) report 96.29% accuracy (F1 94.19%) relying on features extracted from gold-standard parse trees. This figure is an average over several connectives, some of which are extremely hard to classify. More importantly, performance drops considerably in the absence of an oracle providing gold-standard features. We show that a very simple model using only lexical and predicted part-of-speech features actually performs slightly better than Pitler and Nenkova (2009) and not significantly different from a state-of-the-art model, which combines lexical, part-ofspeech, and parse features.
منابع مشابه
Experiments on Sense Annotations and Sense Disambiguation of Discourse Connectives
Discourse connectives can be analyzed as discourse level predicates which project predicate-argument structure on a par with verbs at the sentence level. The Penn Discourse Treebank (PDTB) reflects this view in its design providing annotation of the discourse connectives and their arguments. Like verbs, discourse connectives have multiple senses. We present a set of manual sense annotation stud...
متن کاملDisambiguating Temporal–Contrastive Discourse Connectives for Machine Translation
Temporal–contrastive discourse connectives (although, while, since, etc.) signal various types of relations between clauses such as temporal, contrast, concession and cause. They are often ambiguous and therefore difficult to translate from one language to another. We discuss several new and translation-oriented experiments for the disambiguation of a specific subset of discourse connectives in...
متن کاملImproving the Inference of Implicit Discourse Relations via Classifying Explicit Discourse Connectives
Discourse relation classification is an important component for automatic discourse parsing and natural language understanding. The performance bottleneck of a discourse parser comes from implicit discourse relations, whose discourse connectives are not overtly present. Explicit discourse connectives can potentially be exploited to collect more training data to collect more data and boost the p...
متن کاملDisambiguating potential connectives
Many discourse connectives also have nondiscourse, or sentential readings. Therefore, for automatic discourse structure analysis, there arises a disambiguation problem even before the question of signalled discourse relation beomes relevant. We focus here on a set of nine German connectives and characterize the task of determining their discourse/sentential reading. Starting from an analysis of...
متن کاملDisambiguating temporal-contrastive connectives for machine translation
Temporal–contrastive discourse connectives (although, while, since, etc.) signal various types of relations between clauses such as temporal, contrast, concession and cause. They are often ambiguous and therefore difficult to translate from one language to another. We discuss several new and translation-oriented experiments for the disambiguation of a specific subset of discourse connectives in...
متن کامل